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Tab. 1 : Human endogenous retroviral elements 









% Of 




HERV family 


Copy number 


genome 


Class I HERVs 


- 

HERV-ERI 




0.07 % 


(type C- related 


HERV-E (4-1, ERVA, NP-2) 


35 - 50 




HERVs) 


HERV-E LTR 


500 - 600 






51-1 


35 - 50 






ERV1' 


10 - 15 






HERV-R (ERV3) 


10 






RRHERV-I 


20 






HERV-T (S71 , CRTK1 , CRTK6) 


50 - 60 






HERV-T LTR 


150-200 




• 


ERV-FRD 


8 






HERV-RW 




0.2% 




HERV-W (MSRV) 


25 - 50 






HERV-R (ERV9) 


30-40 






ERV9 LTR 


3000 - 4000 






HERV-P (HuERS-P, HuRRS-P) 


50-90 


0.01% 




HERV-IP 




0.01% 




HERV-I (RTVL-I) 


25-50 






HERV-IP-T47D (ERV-FTD) 


35 






HERV-IP LTR 


1800-2000 






HERV-HF 




0.2% 




HERV-H (RTVL-H, RGH) 


900-1000 






HERV-F 


16 






HERV-H-LTR 


1000 




Class II HERVs 


HERV-K 




0.5% 


(type A-, B-, and 


HERV-K(HML-I) 


10-20 




D-related 


HERV-K(HML-2) 


30-50 




HERVs) 


HERV-K10 








HERV-K-HTDV 








HERV-K-IDDM 








HERV-K(HML3) 


25 






HERV-K(HML-4) 


6 






HERV-K-T47D 








HERV-K(HML-6) 


30-40 






HERV-K-HML-6p 








HERV-K-HML-6.17 








HERV-K(HML- 7) 


? 






HERV-K-NMW7 








HERV-K(HMLS) 


? 






HERV-K(HML-9) 


? 






HERV-K-NMW9 








HERV-K(HML-W) 


10 - 50 






HERV-KC4 








HERV-K LTR 


10 000-25 000 




Foamy virus- 


HERV-L 


100 - 200 


0.02% 


related HERVs 
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Tab. 2: Primers used for the amplification of different HERV-LTR-regions 



$'J£ Nr. £ : 






34 


HERV-K 


ATGGCGGTT1 IGTCGAA 


35 


HERV-K 


GTTCCMTYAGTATTTATTGATC 


36 


HERV-K 


ATGGAGCATACAATCGGG 


3 


HERV-K ' 


AAGAAAAGGGGGAAATGTGGG 


11 


HERVKC4 


AAAGGGAGGGGGGCATG 


12 


HERV-KT47-D 


TAAAAAGGGGGGAGATG 


1 


HERV-H 


ATGTGAGCAACATGGCTGTTTATTTC 


2 


HERV-H 


TGTCAGGCCTCTGAGCCCAA 


39 


HERV-H 


GCCATCTCGAGTGTCAGSCCTCTGAGYCYARGC 


37 


HERV-H 


TATCTTGAATTCGKGTGAGCAAYAARRCTTTA 


31 


polydT 


I I I I I I I I I I I I I I I I I 


17 


HERV-E 


AAAGGGGGGGAAATATG 


18 


HERV-L 


AGGGGTGGGACTTGCGATG 


19 


HERV-W 


TGTTGAGATGGGGGACTGAG 


20* 


HERV-W 


GCAGTTGCAAGATTTAATAGAG 





Tab. 3: Analyzed HERV-LT^P 3/29 
A: HERV-LTRs from different cell lines and tissues 





Primer 


Herkunft 


Homology 


HERV-K2 


34/36 


T47-D 


HERV-K10, M 12854, 97,9 % 


HERV-K3 


34/36 


T47-D 


HERV-K10, M12854, 98,4 % 


HERV-K22-K32-K27-K45 


34/36 


brain 


HERV-K10, M12854, 98,6 % * 


HERV-K30 


3/31 


heart 


HERV-K10, M12854, 97,6 % 


HERV-K-T47D-L5 




T47-D 


MRSV, AF1 27229 


HERV-K-T47D-L50 




T47-D 


MRSV, AF 127229 


HERV-K-T47D-L8 




T47-D 

1 


MRSV, AF127229 


HERV-K-T47D-L9 




T47-D 


MRSV, AF1 27229 


HERV-K-T47D-L48 




T47-D 


MRSV, AF 127229 


HERV-K-T47D-L20 




T47-D 


MRSV, AF 127229 


HERV-IP-T47D 




T47-D 


MRSV, AF 127229 


UCDV TA7n W9 
nCr\ V- 1 *f / 


19/20 


T47-D 


MRSV, AF 127229 


ncr\V" i *♦ / L/-vv*» 


19/20 


T47-D 


MRSV, AF 127229 




19/20 


T47-D 


MRSV, AF 127229 


HERV-W1 


19/20 


T47-D 


MRSV, AF 127229 




19/20 


T47-D 


MRSV, AF127229 


HERV-W11 

nbr\ V ww 1 1 


19/20 


T47-D 


MRSV, AF127229 


HERV-W18 


19/20 


T47-D 


MRSV, AF 127229 


1 ipnw 11 in 

HERV-W2 


19/20 


T47-D 


MRSV, AF 127229 


HERV-W22 


19/20 


T47-D 


MRSV, AF1 27229 


HERV-W23 


19/20 


T47-D 


MRSV, AF 127229 


HERV-W4 


19/20 


T47-D 


MRSV, AF 127229 


HERV-W5 


19/20 


T47-D 


MRSV, AF 127229 


HERV-W6 


19/20 


T47-D 


MRSV, AF 127229 


HERV-W8 


19/20 


T47-D 


MRSV, AF 127229 


HERV-H1 


1/2 


H9 


Cercopithecus aethiops ERV-H; U96012, 87,1% 


HERV-H8 


1/2 


HUT 


HERV-H LTR18106, 84,8% 


HERV-H13 


1/2 


HUT 


HERV-H LTR18106, 91,8% 


HERV-H19 


1/2 


liver 


Callithrix jacchus ERV-H, 5'LTR; U96052, 92,1% 


HERV-H31 


1/2 


liver 


HERV-H(H6) X12717, 99,8% 


HERV-H3 


1/31 


85HG66 


HERV-H(H6) X12717, 100 % 


HERV-H CL1 


1/2 


Chang Liver 


HERV-H(H6) X12717, 100 % 


HERV-H CL2 


1/2 


Chang Liver 


HERV-H LTR18106, 84 % 


HERV-H CL3 


1/2 


Chang Liver 


Callithrix jacchus ERV-H, 5'LTR Silva 5, U96057, 84,2 % 


HERV-H CL4 


1/2 


Chang Liver 


HERV-H(H6) X12717, 100 % 


HERV-H PA7 


1/2 


Panel 


Callithrix jacchus ERV-H, 5'LTR Silva 4, U96062, 85,7 % 


HERV-H PA8 


1/2 


Panel 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 87,1% 


HERV-H PA9 


1/2 


Panel 


HERV-H LTR18106, 85 % 


HERV-H PA10 


1/2 


Panel 


Callithrix jacchus ERV-H, 5'LTR Silva 4 t U96062, 85,6 % 


HERV-H MC14 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 86,6% 


HERV-H MC15 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, U96012, 86,6 % 


HERV-H MC16 


1/2 


MCF7 


Callithrix jacchus ERV-H, 5'LTR Silva 4, U96062, 87,4 % 


HERV-H NIC 17 


1/2 


MCF7 


Cercopithecus aethiops ERV-H, Vero 22, U96012, 86,6% 


HERV-H MP20 


1/2 


MiaPaca 


Human beta globin retrovirus-like repetitive element, k01891, 92,8 % 


HERV-H MP21 


1/2 


MiaPaca 


HERV-H LTR18106, 89,2 % 


HERV-H MP23 


1/2 


MiaPaca 


HERV-H(H6) X12717, 99,5 % 
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B: HERV-LTRs published in the literature 
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> ::■ (bp) • 


reference : 


HERV-K-pl167 


970 


Leib-Mosch eta/, 1993 


HERV-K-T47-D 


1200 


Seifarth et ai, 1998 


HERV-H-H6 


393 


Feuchter und Mager, 1990 


HERV-T-S71A 


625 


Murr, Dissertation, 1998 


HERV-E 


391 


Steele eta!., 1984 


HERV-L 


462 


Cordonnier ef a/., 1995 
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HERV-K-T47D 



i 15 
e 

2>o 



} :l I . I i ■ * 1 1 



HERV-K-PI165 



350 i 

a* 3oo - 

M 250 

I 200- 

i 150 ■ 

Q. 

« 100- 



cell line 



HERV-L 



cell line 



HERV-E 



cell line 



HERV-H-H6 





*5 
> 

u 

n 4 



HERV-T 



cell line 0 




Abb, 2g: relative promoter activity of different HERV-LTRs in different cell lines 
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Fig. 4: LTR-R region modulates promoter activity of HERV-K-T47D related LTRs 
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Fig.7: Retroviral ProCon vectors pLESN-MMTV and pLESN-HERV-H 





Fig.8: Retroviral ProCon vectors pLX-MMTV and pLX-HERV-H 
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Fig. 9: a) Promoter conversion of the hybrid ProCon vectors 

b) Demonstration of the correct promoter conversion with PCR and hybridization 
with a HERV-H and a psi probe (1:CK; 2:CK-pLX-MMTV; 3:CK-pLX-HERV-H) 
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Fig.10: a) organization of the two ProCon vectors pLX-MMTV and pLX-HERV-H 

b) promoter activity of the HERV-H LTR in comparison to the MMTV-LTR by 
infection of CrfK cells 
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Appendix 

A. HERV-H LTR sequences 
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HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



-H L31 
-H HCM2 
19 

MP2 0 
CM3 
MC16 
CM1 
MP23 
H13 
HI 
-H HU8 
-H PA7 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



H L31 
H HCM2 
19 

MP 20 
CM3 
MC16 
CM1 
MP 2 3 
H13 
HI 
HU8 
PA7 



L31 
HCM2 
19 

MP20 
CM3 
MC16 
CM1 
MP2 3 
H13 
HI 
H HU8 
H PA7 



HERV-H L31 

HERV-H HCM2 

HERV-H 19 

HERV-H MP 2 0 

HERV-H CM3 

HERV-H MCI 6 



TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT, 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 
TGTCAGGCCT 



CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 
CTGAGCCCAA 



GC TAAGCC AT 
GCCAGGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCCAAGCCAT 

GCC 

GCCAAGCCAT 
GCCAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 
GCTAAGCCAT 



CATATCCCCT 
CGCATCCCCT 
CATATCCCCT 
CATATCCCCT 
CGCATCCCCT 



CGCATCCCCT 
CGCATCCCCT 
CATATCCCCA 
CAAATCCCCT 
CATATCCC. . 
CAAATCCCCT 



50 

GTGACCTGCA 
GTGACTTGCA 
GCGACCTGCA 
GTGACCTGCA 
GTGACTTGCA 

TGCA 

GTGACTTGCA 
GTGACTTGCA 
GGGACC TGCA 
GTGACCTGCA 
GTGACCTGCA 
GTGACCTACA 









51 








100 


HERV-H 


L31 


CGTATACATC 


CAGATAGCC . 


TGAAG 


CAACTG. . . . 




HERV- 


•H 


HCM2 


CGTATACATC 


CAGATGGCC . 


TAAAG 


T AAC TGAAG A 


TCCA 


HERV- 


■H 


19 , 


CATATACATC 


CAGATGGCC . 


TGAAG 


T AAC TGAAG A 


ATCA 


HERV- 


H 


MP2 0 


CGTACACATC 


CAGATGGCCG 


GTTCCTGCCT 


TAACTGATGA 


CATTCCACCA 


HERV- 


■H 


CM 3 


CGTGTATGCC 


CAGATGGCC . 


TGAAG 


T AAC TGAAG A 


ATCA 


HERV- 


H 


MC16 


CGTATACATC 


CAGATG .... 


AAG 


CAAGTGAAGA 


ATCA 


HERV- 


H 


CM1 


CGTATACGCC 


CAGATGGCC . 


TGAAG 


T AAC TGAAG A 


ATCA 


HERV- 


H 


MP23 


CGTATACGCC 


CAGATGGCC . 


TGAAG 


T AAC TGAAG A 


ATCA 


HERV- 


H 


H13 


CGTATACATC 


CAGATGGCC . 


TGAAG 


CAACTGAAGA 


TCCA 


HERV- 


H 


HI 


GGTGTACATC 


CAGATGACC . 


TGAAG 


CAACTG AAGA 


TCCA 


HERV- 


H 


HU8 


. . TATACATC 


CAGATGGCC . 


TGAAG 


CAACTGAAGA 


TCCA 


HERV- 


H 


PA7 


CGTGTACATC 


CAGATGACC . 


..... TGAAG 


CAACTGAAGA 


TCCA 








101 








150 



CAAAAGAAGT 
CAAAAGAAGT 
CGAAAGAAGT 
CAAAAGAAGT 
CAAAAGAAGT 
CAAAAGAACT 
CAAAAGAACT 
CAAAGGAAGT 
CAAAAGAAGT 
CAAAGGAAGT 
CAAAAGAAGT 



AAAAATATCC 
AAAAACAGCC 
GAAAATGGCC 
GAAAATGACC 
GAAAAGGCCC 
GAAAATGGCC 
GAAAAGGCCC 
GAAAAGGCCC 
GAAAATAGCC 
GAAAGTAGCC 
GAAAATAGCC 
GAAAGTAGCC 



TTAACTGATG 
TTAACTGATG 
TGTTCC .... 
TGTTCG .... 

TGCCCC 

GGTTCC .... 
TGCCCCGCCT 
TGCCCCGCCT 
TTAACTGATG 
TTAACTGATG 
TTAACTGATG 
TTAACTGATG 



ACA. 
ACA. 



. TTCCAATA 
. TTCCAACA 



TAACTGATGA CATTCCACCA 
TAACTGATGA CATTCCACCA 

ACA TTCCACCA 

ACA TTCCACCA 

ACA TTCCACCA 

ACA TTCCACCA 



151 

TTGTGATTTG TTTCTGCCCT ACCCTGACTG 
TTGTGATTTG TTCCTGCCCC ACCCTAACTG 

T GCCTTAACTG 

T GCCTTAACTG 

ACCTTAACTG 

T GCCTTAACTG 

TGGTGATTTG TTCTTGCCCC ACCTTAACTG 
TGGTGATTTG TTCTTGCCCC ACCTTAACTG 
TTGTGATTTG TTTCTGCCCC ATCCTAACTG 
TTGTGATTTG TTCCTGCCCC ACGCTAACTG 
TTGTGATTTG TTCCTGCCCC ACGCTAACTG 
TTGTGATTTG TTCCTGCCCT ACGCTAGCTG 



201 

CCCCCACCCT 
CCCCCACCCT 
TCCTTCTCCT 
TCCTCCTCCT 
TCCTTCCCCT 
TCCTTCTCCT 



TCAGAAGGCT CTTTGTAATC 
TAAGAAGGTC CTTTGTAATT 
GGCTCATCCT GGCTCAAAAG 
GGCTCATCCT GGCTCAAAAG 

GGCTCAG AAG 

GGCTCAG AAG 



ATCAATGTGC 
ATAAATGTAC 
ATGACATTAC 
ATGACATTGT 
AGTGATTAAC 
ATGACATTAC 
AGTGATTAAC 
AGTGATTAAC 
ATCAATGTAC 

AT AC 

AT AC 

AT AC 



CTCCCCACCC 
CTCCCCACCC 
CTC . . CCGCA 
CTC. .CCCGA 
CTC. .CCCCA 
CTC . . CCCCA 



200 

TTTGTAATCT 
TTTGTAATCT 
CTTGTGAAAT 
CTTGTGAAAT 
CCCATGAATT 
CTTGTGAAAT 
CCTGTGAATT 
CCTGTGAATT 
TTTGTAATCT 
CATATATTCT 
CATATATTCT 
CATATATTCT 

250 

TTGAGAATGG 
TTGAGAGTGT 
CTGAG . . . . C 
CTGAG . . . . T 
CTGAG. . . .C 
CTGAG . . . . C 



HERV-H CM1 
HERV-H MP23 
HERV-H H13 
HERV-H Hi 
HERV-H HU8 
HERV-H PA7 
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TGCTTCTCCT GGCTCAG AAG CTC . .CCCCA 

TGCTTCTCCT GGCTCAG AAG CTC . . CCCCA 

CTCCCACCCT TAAGAAGGTT CTTTGTAATT CTCCCCACCC 

TCCCC CGCCC 

TCCCC CGCCC 

TCCCC CGCCC 



CTGAG . . . . C 
CTGAG . . . . C 
TTGAGAGTGT 
TTGAGAATGT 
TTGAGAATGT 
TTGAGAATGT 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



HERV 
HERV 
HERV 
HERV 
HERV- 
HERV 
HERV 
HERV 
HERV 
HERV 
HERV 
HERV 



■H L31 
■H HCM2 
-H 19 
■H MP20 
-H CM3 
•H MC16 
-H CM1 
-H MP 2 3 
■H H13 
-H HI 
■H HU8 
-H PA7 



■H L31 

-H HCM2 

-H 19 

■H MP 20 

■H CM 3 

-H MC16 

■H CM1 

-H MP23 

-H H13 

■H HI 

-H HU8 

■H PA7 



251 300 

ACTTGGTGAG ATCCACCCCC TGCCTGCAAA GCATTGCCCC TAACTCCACC 

ACTTTGTGAG ATCCACACCT GCCCACCAGA GAACAAACCC CCTTTGACTG 

ACCTTGTGAC CCCTGCCTCT GCCCGCCAGA GAGCAACCCC CTCTTGACTG 

ACATTGTGAC CCCCACTCCT GCCCGCCAGA GAACAGCCCC CT . TTGACTG 

ACCTTGTGAC* CCCTGCCCCT GCCCACCAGA GAACAACCCC CT . TTGACTG 

ACCTTGTGAC CCCCACTCCT CCCCGCCACA GAACAACCCC CT. TTGACTG 

ACCTTGTGAC CCCCGCCCCT GCCCACCAGA GAACAACCCC CT. TTGACTG 

ACCTTGTGAC CCCCGCCCCT GCCCACCAGA GAACAGACCC CT . TTGACTG 

ACTTTGTGAG ATCCACCCCC TGCCGGCAAA ACATTGCTCC TAACCCAACC 

ACTTTGTA C 

ACTTTGTA C 

ACTTTGTA C 

301 350 

GCCTGTCCCA AAACCTATGA GAA . CTAATG ATA ATCCC.ACCA 

TAATTTTC C A TTACCTTCCC T AATC C TATA AAACGGCCCC ACCCC.ATCT 

TAATTTTCCT TTACCTACCT . AAATC C TATA AAATGGCCCC ACTCCTATCT 

TAATTTTCCT TTATCTACCC AAATCC TATA AAACAGCCCC ACCTTTATCT 

TAATTTTCCA TTACTTTCCC AAATC C TATA AAACGGCCCC ACCCCTATCT 

TAATTTTCCA CTGCCCGCCC AAACC CTATA AAACGGTCCC ACCCC.ATCT 

TAATTTTCCA TTACCTTCCC AAATCC TATA AAACGGCCCC ACCCCTATCT 

TAATTTTCCA TTACCTTCCC AAATCC TATA AAACGGCCCC ACCCCTATCT 

GCCTA . CCCC AAACCTGTAA GAA . CTAATG ATA ATCC . . ACCA 

ACCTATCCC . AAACCTATAA GAA . CTAATG ATA ATCCT . ACCA 

ACCTATCCC. AAACCTATAA GAA. CTAATG ATA ATCC. ACCA 

ACCTATCCC. AAACCTATAA GAA . CTAATG ATA ATCCT. ACCA 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



■H L31 
■H HCM2 
-H 19 
-H MP20 
-H CM3 
-H MC16 
■H CM1 
■H MP23 
■H H13 
*H HI 
■H HTJ8 
■H PA7 



351 

CACTTTGCTG 
CCCTTTGCTG 
CCCTTCGCTG 
CCCTTGGCTG 
CCCTTCGCTG 
CCCTTCCCTG 
CCCTTCGCTG 
CCCTTCGCTG 
CCCTTTGCTG 
CCCTTTGCTG 
CCCTTTGCTG 
CCCTTTGCTG 



ACTCTCTTTC 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTC . . TTTT 
ACTCTCTTTT 
ACTCTCTTTT 
ACTCTCTTTT 



. AG ACTC 
. GGACTC 
. GGACTC 
. GGACTC 
. GGACTC 
CTTCGGACTC 
C . . . GGACTC 
. GGACTC 
.AG AATC 
.GGACTC 
. GGACTC 
. GGACTC 



AGCCCGGCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 
AGCCCGCCTG 



400 

CACCCAGGTG 
CACCCAGGTG 
TACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CCCCCAGGTG 
CCCCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 
CACCCAGGTG 



HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 
HERV- 



H L31 

H HCM2 

H 19 

H MP 20 

H CM3 

H MC16 

H CM1 

H MP 2 3 

H H13 

H HI 

H HU8 

H PA7 



401 

AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 
AAATAAACAG 



CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 
CCATGTTGCT 



425 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
CACAT 
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B. HERV-W LTR sequences 



1 



50 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-W1 

HERV-WX 0 

HERV-W11 

HERV-W18 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W6 

HERV-W8 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-WX 

HERV-W10 

HERV-WX X 

HERV-WX 8 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W 6 

HERV-W8 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-WX 

HERV-WX 0 

HERV-WX X 

HERV-WX 8 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W 5 

HERV-W6 

HERV-W8 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-WX 

HERV-WXO 

HERV-WX X 

HERV-WX 8 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W6 

HERV-W8 



TGTTGAGATG GGGG AC TG AG AGACAGGACT AGC TGGATTT CCTAGGCCGA 
TGTTGAGATG GGGG AC TG AG AGACAGGACT AGCTGGATTT CCTAGGCCGA 
TGTTGAGATG GGGG AC TG AG AGACAGGACT AC C TGGATTT CCTAGGCCGA 
TGTTGAGATG GGGG AC TG AG AGACAGGACT AGCTGGATTT CCTAGGCCAA 
TGTTGAGATG GGGG AC TG AG AGACAGGACT AGCTGGATTC CCTAGGCCGA 
TGTTGAGATG GGGGAC TG AG AGACAGGACT AGTTGGATTT CCTAGGCTGG 
TGTTGAGATG GGGGAC TG AG AGACAGGACT AGTTGGATTT CCTAGGCCGG 
TGTTGAGATG GGGGAC TGAG AGACAGGACT AGCTGGATTT CCTAGGCCAA 
TGTTGAGATG GGGGAC TGAG AGACAGGACT AGCTGGATTT CCTAGGCTGA 
TGTTGAGATG GGGGAC TGAG AGACAGGACT AGCTGGATTT CCTAGGCTGA 
TGTTGAGATG GGGGAC TGAG AGACAGGACT AGTTGGATTT CCTAGGCTGG 
TGTTGAGATG GGGGAC TGAG AGACAGGACT ACCTGGATTT CCTAGGCCAA 
TGTTGAGATG GGGGAC TGAG AGACAGGACT AGCTGGATTT CCTAGGCCAA 
TGTTGAGATG GGGGAC TGAG AAACAGGACT AGCAGGATTT CCTAGGCCGA 

51 100 

CTAAGAATCC C T AAGC C TAG CTGGGAAGGT GACCGCATCC ACCTTTAAAC 

CTAAGAATTC CTAAGCCTAG CTGGGAAGGT GACCGCATCC ATCT TTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACTACACCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACTACACCC ACCTTTAACC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACCGCATCC ATCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACTACACCC ACCTTTAACC 

CTAAGAATCC CTAAGCCTAG CTGGGAAATT GACCACGTCC ACCTTTAAAC 

CTAAGAATCT CTAAGCCTAG CTGGGAAGGT GACCACATCC ACCTTTAAAC 

CTAAGAATCC CTAAGCCTAG CTGGGAAGGT GACTACACCC ACCTTTAACC 

TTAAGAATCC CTAAGCCTAG ATGGGAAGTT GACCACATCC ACCTTTAAAC 

101 150 

ACGGGGCTCG CAACTTAGCT CACACCCAAC CAATCAGGTA GTAAAGAGGG 

ATGGGGCTTG CAACTTAACT CATATCTGAC CAATCAGGTA GTAAAGAGAG 

ACAGGGCTTG CAACTTAGCT CACACTTGAC CAGTCAGGTA GTAAAGAGAG 

ATGGGGCTTG CAACTTAGCT CACACCCAAC CAATCAGGTA GTAAAGAGAG 

ACGGGGCTTG CAACTTAGCT CATACCCAAC AAATCAGGTA GTAAAGAGAG 

ACGGGGCTTG CAATTTAGCT CACACCCGAC CAATCAGGTA GTAAAGGGAG 

ACGGGGCTTG CAATTTAGCT CACACCCGAC CAATCAGGTA GTAAAGGGAG 

ACTAGGCTTG CAACTTAGCT CACACCCGAC CAATCAGGTA GTAAAGAGAG 

ATGGGGCTTG CAACTTAACT CATATCTGAC CAATCAGGTA GTAAAGAGAG 

ACTAGGCTTG CAACTTAGCT CACACCCGAC CAATCAGGTA GTAAAGAGAG 

ACGGGGCTTG CAATTTAGCT CACACCCGAC CAATCAGGTA GTAAAGGGAG 

ACAGGGCTTG CAACTTAGCT CACACCCGAC CCATCAGGTA AGAAAGAGAG 

ACTAGGCTTG CAACTTAGCT CACACCCGAC CAATCAGGTA GTAAAGAGAG 

ACGGGGCTTG CAACTCAGCT CACACCCGAC CCATCAGGTA AGAAAGAGAG 

!51 200 

CTCACTAAAA TGCTAATTAG GCAAAGACAG GAGGTAAAGA AATAGCCAAT 

CTCACTAAAA TGCTAATTAG GCTAAAACAG GAGGCAAAGA AGTAGCCAAT 

CTCACTAAAA TGCTAATTAG GCTAAAACAG GAGGTAAAGA AATAGACAAT 

CTTGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

CTCACTAAAA TACTGATTAG GCGAAAACAG GAGGTAAGGA AACAGCCAGT 

CTCACTAAAA TGCTAATTAG GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

CTCACTAAAA TGCTAATTAG GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

CTTGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAGAGA AATAGCCAAT 

CTTGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

CTTGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

CTCACTAAAA TGCTAATTAG GGAAAAACAG GAGGTAAAGA AGTAGCCAAT 

CCCGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGTCAAT 

CTTGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGCCAGT 

CCCGCTAAAA TGCTAATTAG GCAAAAACAG GAGGTAAAGA AATAGCCAAT 



0«3i , gU„4-(!3iiS.iS 




25/29 



201 



250 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-W1 

HERV-W10 

HERV-W11 

HERV-W18 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W6 

HERV-W8 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D T W5 

HERV-W1 

HERV-W10 

HERV-WX1 

HERV-W18 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W6 

HERV-W8 



HERV-T47D-W2 

HERV-T47D-W4 

HERV-T47D-W5 

HERV-W1 

HERV-W10 

HERV-W11 

HERV-W18 

HERV-W2 

HERV-W22 

HERV-W23 

HERV-W4 

HERV-W5 

HERV-W6 

HERV-W8 



CATCTATTGC CTGAGAGCAC AGCAGGAGGG ACAATGATCG GGATATAAAC 
CATCTGTTGC CTGACAGCAC AGCAGGAGGG ACAATGATCG GGATATAAAC 
CATCTATCAC CTGAGAGCAC AGTGGGAGGG ACAATGATCG GCATA TAAAC 
CATCTATCGC CTGACAGCAC AAGGGGCGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGACAGCAC AAGGGGCGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGAGAGCAC AACAGGAGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGAGAGCAC AACAGGAGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGAGAGCAC AGCAGGAGGG ACAATGATCC GGATATAAAC 
CATCTATCGC CTGACAGCAC AAGGGGCGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGACAGCAC AAGGGGCGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGAGAGCAC AACAGGAGGG ACAATGATCA GGATATAAAC 
CATCTATTGC CTGAGAGCAC AGCGGGAGGG ACAATGATCA GGATATAAAC 
CATCTATCGC CTGACAGCAC AAGGGGCGGG ACAATGATCA GGATATAAAC 
CATCTATTGC CTGAGAGCAC AGCGGGAGGG ACAATGATCA GGATATAAAC 



CCAAGTCTTC GAGCCGGCAA TGGCTACCTT CTTTGGGTCC CCTCCCTTTG 

CCAGGCATTC GAGCCAGCTA CAGCTACCCT CTTTGGGTCC CCTCCCTTTG 

CCAGGCATTC GAGCCAGCAA CAGCAACCCC CTTTGGG 

TCAGGCATTC AAGCCAGCAA TGGCTACCCA CTTTGGGTCC CCTCCCATTT 

TCAGGCATTC AAGCCAGCAA TGGCTACCCA CTTTGGGTCC CCTCCCATTT 

CCAGGCATTC AAGCCAGCGG TGGCTACCCT CTTTGGGTCC CCTCCCTTTG 

CCAGGCATTC AAGCCAGCGG TGGCTACCCT CTTTGGGTCC CCTCCCTTTG 

CCAAGCATTC GAGCCAGCAA TGGCTACCCT CTTTGTGTCC CCTCCCTTTG 

TCAGGCATTC AAGCCAGCAA TGGCTACCCA CTTTGGGTCC CCTCCCATTT 

TCAGGCATTC AAGCCAGCAA TGGCTACCCA CTTTGGGTCC CCTCCCATTT 

CCAGGCATTC AAGCCAGCGG TGGCTACCCT CTTTGGGTCC CCTCCCTTTG 

CCAGGCATTC GAGCCGGCAA CGACTACCCT CTTTGGGTCC CCTCCCTTTG 

TCAGGCATTC AAGCCAGCAA TGGCTACCCA CTTTGGGTCC CCTCCCATTT 

CCAGGCATTC GAGCCGGCAA CGACTACCCT CTTTGGGTCC CCTCCCTTTG 

301 343 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG C . . 
TATGGGAGCT CTGTCTTCAC TCTATTAAAT CTTGCAACTG C. . 

AGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG C. . 

TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCC CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGAAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT C TATTTTC AC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGAAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 
TATGGGAGCT CTGTTTTCAC TCTATTAAAT CTTGCAACTG CAA 



l«39JL«+6 6iS qO 



C. HERV-K LTR sequences 
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HERV-K45 
HERV-K27 
HERV-K 2 
HERV-K 1 
HERV-K30 
HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K 1 

HERV-K30 

HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K 1 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K 1 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K 1 

HERV-K30 

HERV-KIO 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K 1 

HERV-K30 

HERV-KIO 



HERV-K45 
HERV-K27 
HERV-K 2 
HERV-K 1 
HERV-K3 0 
HERV-KIO 



HERV-K45 
HERV-K27 
HERV-K 2 
HERV-K 1 
HERV-K30 
HERV-KIO 



GCGACCGGT: GGATC : CCGG GCCCGCGG : T ACCGTCGACT 

GCGACCGGT: GGATC : CCGG GCCCGCGG : T ACCGTCGACT 

GCGACCGGT: GGATC : CCGG GCCCGCGG : T ACCGTCGACT 

GCGACCGGT: GGATC : CCGG GCCCGCGG : T ACCGTCGACT 

GTC CCACCTCCAG CCCTAAGGCG GTTTTTCCCT 



50 

GCAGAATTC 
GCAGAATTC 
GCAGAATTC 
GCAGAATTC 
ATCTCAGTAG 
AGTAG 



51 

ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 
ATGGAGCATA 



CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 
CAATCGGGTT 



TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 
TTATACCGAG 



ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 
ACATTCCATT 



, 100 

GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 
GCCCAGGGAC 



101 

AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 
AGGCAGGAGA 



CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 
CAGATGCCTT 



CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 
CCTCTTGTCT 



CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 
CAACTGCAAG 



150 

AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 
AGGCATTCCT 



151 

TCCTCTTATA 
TC C TC TT ATA 
TCCTCTTATA 
TCCTCTTATA 
TCCTCTTATA 
TCCTCTTTTA 



CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 
CTAATCCTCC 



TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 
TCAGCACAGA 



CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACGG 
CCCTTTACAG 



200 

GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 
GTGTCGGGCT 



201 

GGGGGACGGT 
GGGGGACGGT 
GGGGGATGGT 
GGGGGACGGT 
GGGGGACGGT 
GGGGGACGGT 



CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 
CAGGTCTTTC 



CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 
CCTTCCCACG 



AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 
AGGCCATATT 



250 

TC AG AC TATC 
TC AG AC TATC 
TC AG AC TATC 
TC AG AC TATC 
TCAGACTATC 
TC AG AC TATC 



251 

ACATGGGGAG 
ACATGGGGAG 
ACATGGGAAG 
ACATGGGGAG 
ACATGGGGAG 
ACATGGGGAG 



AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 
AAACCTTGGA 



CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 
CAATACCTGG 



CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 
CTTTCCTAGG 



300 

CAGAGGTCCC 
C AG AGGTC C C 
CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 
CAGAGGTCCC 



301 , 

TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 
TGCGGCCTTC 



CGCAGTTTTT 
CGCAGTTTTT 
CGCAGTTTTT 
CGCAGTTTTT 
CGCAGTTTTT 
TGCAGTTTTT 



GTGT : CCTGG 
GTGT : CCTGG 
GTGT : CCTGG 
GTGT : CCTGG 
GTGTCC : TGG 
GTGTCCCTGG 



GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 
GTACTTGAGA 



350 

TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 
TTAGGGAGTG 



351 , 

GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 
GTGATGACTC 



TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 
TTAAGGAGCA 



TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 
TGCTGCCTTC 



AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 
AAGCATCTGT 



400 

TTAACAAAGC 
TTAACAAAGC 
TTAACAAAGC 
TTAACAAAGC 
TTAACAAGGC 
TTAACAAAGC 



0 
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401 



450 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K1 

HERV-K30 

HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K1 

HERV-K30 

HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K1 

HERV-K30 

HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K1 

HERV-K30 

HERV-K10 



HERV-K45 

HERV-K27 

HERV-K2 

HERV-K1 

HERV-K30 

HERV-K10 



HERV-K1 
HERV-K30 



ACATCCTGCA CCGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCACAC 

ACATCCTGCA CCGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCACAC 

ACATCCTGCA CTGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCGCAC 

ACATCCTGCA CCGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCACAC 

ACATCCTGCA CCGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCACAC 

ACATCCTGCA CCGCCCTTAA TCCATTCAAC CCTGAGTTGA CACAGCACAT 

451 550 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA AC AG AATC TC 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA ACAGAATCTC 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA ACAGAATCTC 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA ACAGAATCTC 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA ACAGAATCTC 

GTTTCAGAGA GCACGGGGTT GGGGGTAAGG TCATAGATTA ACAGAATCTC 

501 550 

AAGGCAGAAG AATTTTTCTT AACACATAAC AAAATGGAGT CTCCCATGTC 

AAGGCAGAAG AATTTTTCTT AACACATAAC AAAATGGAGT CTCCCATGTC 

AAGGCAGAAG AATTTTTCTT AACACATAAC AAAATGGAGT CTCCCATGTC 

AAGGCAGAAG AATTTTTCTT AACACATAAC AAAATGGAGT CTCCCATGTC 

AAGGCAGAAG AATTTTTCTT AACACATAAC AAAATGGAGT CTCCCATGTC 

AAGGCAGAAG AATTTTTCTT AGCACATAAC AAAATGGAGT CTCCTATGTC 

551 600 

TACTTCTTTC TACACAGACA CAGTAACAAT CTGATCTCTC TTGCTTTTCC 

TACTTCTTTC TACACAGACA CAGTAACAAT CTGATCTCTC TTGCTTTTCC 

TACTTCTTTC TACACAGACA CAGTAACAAT CTGATCTCTC TTGCTTTTCC 

TACTTCTTTC TACACAGACA CAGTAACAAT CTGATCTCTC TTGCTTTTCC 

TACTTCTTTC TACACAGACA CAGTAACAAT CTGATCCCTC TTGCTTTTCC 

TACTTCTTTC TACACAGACA CAGTAACAAT TTGATCTCTC TTGCTTTTCC 

601 650 

CCACATTTCC CCCTTTTCTT TTCG 

CCACATTTCC CCCTTTTCTT TTCGA 

CCACATTTCC CCCTTTTCTT TTCGACAAA 

CCACATTTCC CCCTTTTCTT TTCGACAAAA CCGCCAT : CT CGAGATC : TG 

CCACATTTCC CCCTTTTCTT ATCCATCACA CTGGCGGCCG CTCGAGCATG 

CCACATTTCC CCCTTTTCTT TTCGACAAAA CCGCCATC 

651 

AGT 

CATCTAGAGG GCCCAATTCG CCCTATAGTG 
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HERV-K-T47D-5 * LTR 



TGTGGGCGAAGGATTACCCAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATATAGAAAATAGCTAG 
AATAAGAATAGTTATAATAAAAATTAGATATACACATGATCATGGACATTACCAATCATTACTACAAACATTGTTAATCA 
TTAGCTTTTAATATTACTCTTTGTTTTATTACTAATATAACCAAGGAATAACCGGTAGCATACGGTCAGGTGCTGAAGGG 
ACATTGTGAGAAGTGACCTAGAAGGCAAGAGGTGAGCCTTCTGTCACGCCTGCATAAGGACAGCTTGAGGGCTCCTTGGT 
CAAGCTGTAACACCAGTGCCTGGGAAGGCACCGTTACTTAGCAGACCATGAAAGGGAGTCTCCATTCCTTGGAGGAGTCA 
GGGAAACACTATGCTCCACCAGCTTCTTGTGTATCCAGCCCTGCCCACAGTCATCCAGAGGCATAAACCCCTCCCTGTGG 
TGCTGTGCTTCAATGGCCATGCTTCTTGTCCACTTTCATGTTCCTCCTGTACTCCTGGTTCCTCTTTGAAGTTCGTAGAA 
GATAATGGTAGAAGAAATAGTGAAAGTCTTTGATCTTTCTTATAAGTGCATAGAAGAAAACACTGATGTATGCCTGCCTT 
CCCTCTCTGCTTCAGCTACCTAAAAGGAAAGGCCCCCTTTCCCATGATCACATGACTTGCCTGACCTTATCAATCACTTG 
GAGGACTCACCCTCCTTACCCTGTCCCTTTGTCTTGTATGCAATAAATATCAGCACGCCCAGCCATTCGGGGCCACTACT 
GGTCTCCGCAACTTGGTGGTAGTGGTACCCTGGGCCCAGCTGTTTTCTCTTTATCTCTTTTGTCTTGTGTCTTTATTTCT 
TACAATCTCTCATCTCTGCACATGGGGAGAACACCGGCAAAGCCCGTAGGGCTGGACCTTACA 

L48-LTR (U3-R) 

TGTGGGCGGAAGAGTACCTAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATACAGCGATGATCATGAACAATTATCCATCATTATTATAAACATTATTAATCATTA 
GCTTTTAATATTACTCTGTTGCATTAATAATATAACCTAGGAATAACCGGCAGGTATAGGGTCAGGTGCTGAAGGGACAT 
TGTGAGAAGTGAATAGAAGGCAAGAGGGGAGCCTTCTGTCATGCCCGCATAAGGGCCGCTTGAGGGCCCCTTGGTCAAGC 
GGTAACGCCAGTGTCTGGGAAGGCACCCGTTACTGAGCAGACCGGGAAAGGGAGTCTCCTTTCCTTGGAGGAGTCAGGGA 
ACGCTCTGCTCCACCAGCTTCTTGTGGGAGGCTGGATGTTACCCAGGCCTGCCTGCAGTCATCCGGAGGCCTGAACCCCT 
CCCTGTGGTGCTTCAATGGTCACGTTCCTTGTCCACTTTCATGCTCCTTCCGTACTCCTGGTTCCTCTTTGAAGTTCGTA 
GTAGATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTTGATCTTATAAGTTCATAGAAGAAAACGCTGATGCCTGC 
CGCCTTCTCTCTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCGCTGTCCTGTGATCAGGTGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGACTGACCCTCCTTATCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGCGAGCCCAGCCGTTCAGGGC 
CACTACCGGTCTCCGTGTCTTTGTGGTAGTGGTCCCCGGGCCCAGCTGTTTTCTCTTT 

L5-LTR (U3-R) 

TGTGGGTGGAGGATTACCCAGGTGCCAAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAAAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTAGCAATCACTATTAATCTTTAGCTTTTAATATT 
ACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAGGGACATTGTGTGAAGT 
GACCTGGAAGGCAAGAGGTGAGCCCTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCCTTGGTCAAGTGGTAACGCC 
AGTGTCTGGGAATGCACCCGTTAATTAGCAGACCGCGAAAGGGAGTCTCCTTTCCTTGGAAGAGTTGGGGAACACTCTGC 
TCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCGCGCAGTCATCCGGAGGCTTAAACCCTTCCCTGTGGT 
GCTGTGCTTCAATGGTCCCACTCCTTGTCCACTTTCATGCTCCTCCCGTACTCCTGGTTCCTCTTTGAAGAGCGCAGTAG 
ATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTCGATCTTTCTTACAAGTGCAGAGAAGAAAACGCTGACATATGC 
TGCCTTCCCTCTCTGCTTCGGCTACCTAAAAGGGAAGGGCCGCCTATCCTGTAATCACATGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGATTCACTCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGACCCCAGCCGTTCAGGGC 
CACTACTGGTCTCCGCGTCTTGATGGTAGTGGTCACCCCGGCCC 



L50-LTR (U3-R) 

TGTGGGTGGAGGATTACCCAGGTGCCAAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAAAAAATAGAATA 
AGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTAGCAATCACTATTAATCTTTAGCTTTTAATATT 
ACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAGGGACATTGTGAGAAGT 
GACCTGGAAGGCAAGAGGTGAGCCCTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCCTTGGTCAAGTGGTAACGCC 
AGTGTCTGGGAATGCACCCGTTAATTAGCAGACCGCGAAAGGGAGTCTCCTTTCCTTGGAAGAGTTGGGGAACACTCTGC 
TCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCGCGCAGTCATCCGGAGGCTTAAACCCTTCCCTGTGGT 
GCTGTGCTTCAATGGTCCCACTCCTTGTCCACTTTCATGCTCCTCCCGTACTCCTGGTTCCTCTTTGAAGAGCGCAGTAG 
ATAGCGGTAGAAGAAATAGTGAAAGTCTTAAAGTCTTCGATCTTTCTTACAAGTGCAGAGAAGAAAACGCTGACATATGC 
TGCCTTCCCTCTCTGCTTCGGCTACCTAAAAGGGAAGGGCCGCCTATCCTGTAATCACATGACTTGCTTCACCTTGTCAA 
TCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGACCCCAGCCGTTCAGGGC 
CACTACTGGTCTCCGCGTCTTGATGGTAGTGGTCACCCCGGCCCAGGTGTTTTTTCTTT 
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L9-LTR (966 nt) 

TGTGGGTGGAGGATTACCCAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATGGTTAG 
AATAAGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTATTAATC 
ATTAGCTTTTAATATTACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAA 
GGACATTGGGAGAAGTGACCTAGAAGGCAAGAGGTGAGTCTTCTGTCACGCCCGCATAAGGGTTGCTTGAGGGCTCCTTG 
GTCAAGTGGTAACGCCGGTGTCTGGGAAGGCACCTGTTACTTAGCCGACCACGAAAGGGAGTCTCCTTTCCTTGGAGGAG 
TCAGGGCGCACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCCCGCAGTCATCCGGAGGCCTA 
AACCCCTCCCTGTGGTGCTGTGCTTCAATGGGCACACTCCTCGTCCACTTTCATGTTCCTCCCATACTCCTGGTTTCTCT 
TTGAAGTTCGTAGTAGATAGTGGTAGAAGGAATAGGGAAAATCTTAAAGTGTTTGATCTTTCTTATAAGTGCATAGAAGA 
AAACGCTGACATATGCTGCCTTCTCTGTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCCCTGTCCAGTGATCACGTGACT 
TGCTTCACCTTGTCAATCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAGTGCA 
CCCAGCCTTTCGGGGCCACTTACCGGTCTCCACGTCTTGGTGGTAGTGGTCCCCCGGGCCCAGCTGTTTTCTCTTTATCT 
CTTTGTCTTGTGTCTTATTTATTACAATCTCTCGTCTCCGCACACAGGGAGAACACCCGCTAAGCTCCGTAGGGCTGGAC 
CCTACA 



L8-LTR (938 nt ) 

TGTGGGTGGAGGATTACCCAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATGGTTAG 

AATAAGAATAGTCATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTATTAATC 

ATTAGCTTTTAATATTACTCTTTGTTGCATTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGCTGAAG 

GGACATTGGGAGAAGTGACCTAGAAGGCAAGAGGTGAGTCTTCTGTCACGCCCGCATAAGGGTTGCTTGAGGGCTCCTTG 

GTCAAGTGGTAACGCCGGTGTCTGGGAAGGCACCTGTTACTTAGCCGACCACGAAAGGGAGTCTCCTTTCCTTGGAGGAG 

TCAGGGCACACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCCAGGCCTGCCCGCAGTCATCCGGAGGCCTA 

AACCCCTCCCTGTGGTGCTGTGCTTCAATGGGCACACTCCTCGTCCACTTTCATGTTCCTCCCATACTCCTGGTTCCTCT 

TTGAAGTTCGTAGTAGATAGTGGTAGAAGGAATAGGGAAAATCTTAAAGTGTTTGATCTTTCTTATAAGTGCATAGAAGA 

AAACGCTGACATATGCTGCCTTCTCTGTCTGCTTCAGCTACCTAAGAGGGAAGGGCCCCCTGTCCAGTGATCACGTGACT 

TGCTTCACCTTGTCAATCACTTAGAAGATTCACCCTCCTTACCCTGCCCCCTTGTCTTGTATGCAATAAATATCAG.TGCA 

CCCAGCCTTTCGGKKCACTTACCGGTCTCCACGTCTTGGTGGTAGTGGTCCCCCGGCCCAGCTGTTTTCTCTTTATCTCT 

TTGTCTTGTGTCTTATTTATTACAATCTCTCGTCTCCGCACACAGGGAGAACACCCGC {Abbruch 2 6 nt vor 

Ende der LTR) 



L49-LTR = L20-LTR (963 nt) 

TGTGGGCGAAAGATTACCTAGGTGCCGAGGCAAGAGACTGAAGGCACAAACTGTTTCAGTATAATAAAGAAAATAGTTAA 
AATAAGAATAGTTATAATACAAATTAGATATAGAGATGATCATGGACAATTATCAATCATTATTATAAACATTAATCATT 
AGCTTTTAATATTACTCTTTGTTGCTTTACTAATATAACCTAGGAATAACCGGTGGGTATAGGGTCAGGTGTTGACGGGA 
TATTGTGAGAAGTGACCTAGAAGGCAAGAGGTGAGCCTTCTGTCACGCCCACATAAGGGCCGCTTGAGGGCTCTTTGGTC 
AAGTGGTAACGCCAGTGTCTGTGAAGGCACCTGTTACTTAGCAGACCGCGAAAGGGAGTCTCCTTTCCTTGGAGGAGTCA 
GGGAACACTCTGCTCCACCAGCTTCTTGTGGAAGGCTGGATATTATCTAGGCCTGCCCGCAGTCATCTGGAGGCCTAAAC 
CCCTCCCTGTGGTGCTGTGCTTCAGTGGTCACTCTCCTTGTCCACTTTCATGTTCCTCCCGTACTCCTGGTTCCTCTTTG 
AAGTTCGTAGTAGATAGCAGTAGAAGAAATAGTGAAAGTCTTAAAGTATTTGATCTTTCTTATAAGTGCATAGAAGAAAA 
CGCTGACATATGCTGCCTTCTCTATCTCTGCGGTGGCTACCTAAAAGGGAAGGGCCCCCTGTCCCATGATCATGTGACTT 
GCTTCACCTTATCACTTAGAAGATTCATCCTCCTTACCCTGCGCCCCCTCGTCTTGTATGCAATAAATATCAGCACGCCC 
AGTCGTTTGAGGCCACTGCCGGTCTCCGCGTCTTGGTGGTAGTGGTCCCCCGGGCCCAGCTATTGTCTCTTTATCTCTTT 
GTCTTGTGTCTTTATTTATTACAATCTCTTGTCTCTGCACACAGGGAGAACACCTGCTAAGCCCCGTAGGACTGGACCCT 
ACA 



HERV-IP-T47D 

TGTTCAATTCTTTGCCTTCTACTTTTAAACTTAACTTCCTCATAAAGCAACCTTTTTCAATCACCTGCTCCACTCTGACT 
CATTCTGATCACCTGCTCCACCCTGACTCATTCCGATCACCTGATCCACTGTGACTCATTCCGATTACCCGCTCCACCCT 
GACTCATTCTGATTCTGATTTCCTGCTCTGCCATAACCATTTTTCCCGCCAAACCACTCACCCTGTCACTCTCTTTAAAT 
TAGCCAATTGGAATTAGTTTAGCCTGTGCGGTCTAACCCTAGCCAATAGGGGACTGACACAGCAGCAGGGGCCACATGTG 
TCAGGAATAAGACCCCCTTCCCCTCCCTGTCCAGATGTGTGCTCACCATTGCTCCATCTGTGAGGGCACACCCTTCTATA 
GAAGTAAATTGCCTTGCTGAGAAGAAAAAAAAGAACATTTTATATTCAAGTCCTATTTCTTTTGCTGCACCGAAACTTTA 
TTTATAACA 



